Integrated cTAKES for Concept Mention Detection and Normalization
نویسندگان
چکیده
We participated Task 1 using an existing system MedTagger implemented in integrated cTAKES (icTAKES). The concept mention detection is based on Conditional Random Fields (CRF) and the concept mention normalization is based on a greedy dictionary lookup algorithm. A distinctive feature in MedTagger compared to other concept mention detection systems is the incorporation of dictionary lookup results into a machine learning framework for sequential labeling. Dictionary lookup results of MedLex and semantic vectors representing distributed semantics were used as features. Overall, the precision, recall, and F-measure of our best run for concept mention are 0.8, 0.573, and 0.668 respectively for strict evaluation and 0.939, 0.766, and 0.844 for relaxed evaluation. The accuracy of our best run for concept mention normalization is 54.6% and 87.0% for strict and relaxed mapping, respectively.
منابع مشابه
QuickUMLS: a fast, unsupervised approach for medical concept extraction
Entity extraction is a fundamental step in many health informatics systems. In recent years, tools such as MetaMap and cTAKES have been widely used for medical concept extraction on medical literature and clinical notes; however, relatively little interest has been placed on their scalability to large datasets. In this work, we present QuickUMLS: a fast, unsupervised, approximate dictionary mat...
متن کاملTeamUEvora at Clef eHealth 2014 Task2a
We present our first participation in a ShARe/CLEF eHealth Lab contributing for task 2a. Task 2 is an extension of the 2013 lab task 1 and consists of information extraction from clinical texts for Disease/Disorder Template Filling; task 2a aims at predicting each attribute’s normalization value. This work constitutes a preliminary approach to the problem of extracting and handling information ...
متن کاملMayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications
We aim to build and evaluate an open-source natural language processing system for information extraction from electronic medical record clinical free-text. We describe and evaluate our system, the clinical Text Analysis and Knowledge Extraction System (cTAKES), released open-source at http://www.ohnlp.org. The cTAKES builds on existing open-source technologies-the Unstructured Information Mana...
متن کاملMedical Disorder Recognition with Structural Support Vector Machines
In this paper we present two systems that address the issues of disorder recognition and normalization submitted by the authors as defined by the CLEF/ShARe Evaluation Lab. The first approach to the tasks formed a baseline approach using the cTakes system. Our second approach leveraged Structural Support Vector Machines with an array of feature types including lexical, semantic and cluster base...
متن کاملCUAB: Supervised Learning of Disorders and their Attributes using Relations
We implemented an end-to-end system for disorder identification and slot filling. For identifying spans for both disorders and their attributes, we used a linear chain conditional random field (CRF) approach coupled with cTAKES for pre-processing. For combining disjoint disorder spans, finding relations between attributes and disorders, and attribute normalization, we used l2-regularized l2-los...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013